Profile-Profile Alignment: A Powerful Tool for Protein Structure Prediction

نویسندگان

  • Niklas von Öhsen
  • Ingolf Sommer
  • Ralf Zimmer
چکیده

The problem of computing the tertiary structure of a protein from a given amino acid sequence has been a major subject of bioinformatics research during the last decade. Many different approaches have been taken to tackle the problem, the most successful of which are based on searching databases to identify a similar amino acid sequence in the PDB and using the corresponding structure as a template for modeling the structure of the query sequence. An important advance for the evaluation of sequence similarity in this context has been the use of a frequency profile that represents a part of the protein sequence space close to the query sequence instead of the query sequence itself. In this paper, we present a further extension of this principle by using profiles instead of the template sequences, also. We show that, by using our newly developed scoring model, the profile-profile alignment approach is able to significantly outperform current state of the art methods like PSI-BLAST, HMMs, or threading methods in a fold recognition setup. This is especially interesting since we show that it holds for closely related sequences as well as for very distantly related ones.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction

Protein sequence alignment is essential for template-based protein structure prediction and function annotation. We collect 20 sequence alignment algorithms, 10 published and 10 newly developed, which cover all representative sequence- and profile-based alignment approaches. These algorithms are benchmarked on 538 non-redundant proteins for protein fold-recognition on a uniform template library...

متن کامل

Length Encoded Secondary Structure Profile for Remote Homologous Protein Detection

Protein data has an explosive increasing rate both in volume and diversity, yet many of its structures remain unresolved, as well their functions remain to be identified. The conventional sequence alignment tools are insufficient in remote homology detection, while the current structural alignment tools would encounter the difficulties for proteins of unresolved structure. Here, we aimed to ove...

متن کامل

A generalization of Profile Hidden Markov Model (PHMM) using one-by-one dependency between sequences

The Profile Hidden Markov Model (PHMM) can be poor at capturing dependency between observations because of the statistical assumptions it makes. To overcome this limitation, the dependency between residues in a multiple sequence alignment (MSA) which is the representative of a PHMM can be combined with the PHMM. Based on the fact that sequences appearing in the final MSA are written based on th...

متن کامل

Protein homology detection by HMM?CHMM comparison

MOTIVATION Protein homology detection and sequence alignment are at the basis of protein structure prediction, function prediction and evolution. RESULTS We have generalized the alignment of protein sequences with a profile hidden Markov model (HMM) to the case of pairwise alignment of profile HMMs. We present a method for detecting distant homologous relationships between proteins based on t...

متن کامل

Template‐based quaternary structure prediction of proteins using enhanced profile–profile alignments

Proteins often exist as their multimeric forms when they function as so-called biological assemblies consisting of the specific number and arrangement of protein subunits. Consequently, elucidating biological assemblies is necessary to improve understanding of protein function. Template-Based Modeling (TBM), based on known protein structures, has been used widely for protein structure predictio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

دوره   شماره 

صفحات  -

تاریخ انتشار 2003